Making the Shortest-paths Approach to Sum-of-pairs Multiple Sequence Alignment More Space Eecient in Practice
نویسندگان
چکیده
The MSA program is one of the few existing programs that attempts to nd optimal alignments of multiple protein or DNA sequences. The MSA program was written and distributed in 1989. It implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage of MSA. The improvements make feasible a variety of problem instances that were not feasible previously. On some runs we achieve an order of magnitude reduction in space usage and a signi cant multiplicative factor improvement in running time. To explain how the improvements work, it is necessary to give a much more detailed description of MSA than has been previously published. In practice, MSA rarely produces a provably optimal alignment; the reasons for lack of optimality are discussed at length in the body of the paper. 2
منابع مشابه
Improving the Practical Space and Time Efficiency of the Shortest-Paths Approach to Sum-of-Pairs Multiple Sequence Alignment
The MSA program, written and distributed in 1989, is one of the few existing programs that attempts to find optimal alignments of multiple protein or DNA sequences. The MSA program implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage o...
متن کاملImproving the Practical Space and Time Eeciency of the Shortest-paths Approach to Sum-of-pairs Multiple Sequence Alignment
The MSA program, written and distributed in 1989, is one of the few existing programs that attempts to nd optimal alignments of multiple protein or DNA sequences. The MSA program implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage of ...
متن کاملOptimal Alignment of Multiple Sequence Alignments
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 CHAPTER 1: INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . 14 1.1 Perspective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 1.2 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16 1.3 Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ...
متن کاملImproving the Divide - and - ConquerApproach to Sum - of - PairsMultiple Sequence
|We consider the problem of multiple sequence alignment: given k sequences of length at most n and a certain scoring function, nd an alignment that minimizes the corresponding \sum of pairs" distance score. We generalize the divide-and-conquer technique described in 1,2], and present new ideas on how to use eecient search strategies for saving computer memory and accelerating the procedure for ...
متن کاملAn Application of the ABS LX Algorithm to Multiple Sequence Alignment
We present an application of ABS algorithms for multiple sequence alignment (MSA). The Markov decision process (MDP) based model leads to a linear programming problem (LPP), whose solution is linked to a suggested alignment. The important features of our work include the facility of alignment of multiple sequences simultaneously and no limit for the length of the sequences. Our goal here is to ...
متن کامل